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(S7\ A technique, specif icafly apparatus and an 
accompanying method, for accurately thresholding an 
image based on local image properties, specif cany lumi- 
nance variations, and particularly such a technique that 
uses multi-windowing tor providing enhanced immunity 
to image noise and lessened boundary artifacts. Specif- 
ically a localized intensity gradient. G(ij). is determined 
for a pre-defined window (300) centered about each 
image pixel(ij). Localized minimum and maximum pixel 
Intensity measures. and L^ , respects y .are also 
determined for another, though larger, window (330) cen- 
tered about pixel(ij). Also, a localized area gradient 
measure. GS(ij). is determined as a sum of .'^ua 
intensity gradients for a matm of pixel positions 370) 
centered about pixel position (i j). Each image pixel(i j) is 
then classified as being an object pixel, i.e.. black, or a 
background pixel, i.e.. white, based upon its area gradi- 
ent GS(ij), and associated L^m and ^ measures. 
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Description 

TPrMNIHAL FIE ' D QE INVENTION 



The invention relates to a technique, specifically apparatus and an accompanying method, for accurately thresfv 
olding an image based on local image properties, specifically luminance variations, and particularly one that uses multi- 
windowing lor providing enhanced immunity to image noise and lessened boundary artifacts. 

RAr.knRmiND ART 

With digital image processing and digital communication becoming increasingly prevalent today, increasing amounts 
of printed or other textual documents are being scanned for subsequent computerized processing of one form or another 
and/or digital transmission. This processing may involve, for example, optical character recognition, for converting printed 
characters, whether machine printed or handwritten, from scanned bit-mapped form into an appropriate character set, 
such as ASCII, the latter being more suitable for use with word processing and similar computerized document-process- 

Scanning a gray-scale document typically yields a multi-bit, typically eight-bit, value for each pixel in the scanned 
document. The value represents the luminance, in terms of a 256-level gray scale, of a pixel at a corresponding point 
in the document. These pixels are generated, depending upon the resolution of the scanner, frequently at resolutions 
of 200-400 pixels/inch (approximately 80-1 60 pixels/cm), though with highly detailed images at upwards of 1 200 or more 
pixels/inch (approximately 470 pixels/cm). Consequently, a scanned 8 1 12 by 11 inch (approximately 22 by 28 cm) image 
will contain a considerable amount of gray-scale data. Inasmuch scanned text generally presents a written or printed 
characters of some sort against a contrasting colored background, typically white or black print against a white or light 
colored background, or vice versa, the exact luminance value at any one pixel in the text is not as important as whether 
that pixel is either part of a character or the background. Therefore, scanned textual images, or scanned textual portions 
of larger images containing both text and graphics, can be efficiently represented by single-bit pixels in which each pixel 
in a scanned image is simply set to. e.g.. a "one" if that pixel in the original image is part of a character or part of 
foreground information, or to. e.g.. a "zero" H that pixel in the original image is part of the image background. To easily 
distinguish the different types of scanned images, a gray-level image is defined as one having multi-bit (hence multi- 
value) pixels, whereas a binary (or bi-level) image is formed of single-bit pixels. Furthermore, since binary images gen- t 
erate considerably less data for a given textual image, such as, e.g.. one-eighth, as much as for an eight-bit gray-scale 
rendering of the same image, binary images are more efficient over corresponding gray-scale images and thus preferred 
for storage and communication of textual images. Binary images are also preferred because of their easy compressibility 
using standard compression techniques. e.g.. CCITT Groups 3 or 4 compression standards. 

Gray-scale images are converted to binary images through a so-called thresholding process. In essence/each multi- 
bit pixel value in a gray-scale scanned image is compared to a pre-defined threshold value, which may be fixed, variable 
or even adaptively variable, to yield a single corresponding output bit. If the multi-bit pixel value equals or exceeds the 
threshold value for that particular pixel, the resultant single-bit output pixel is set to a "one"; otherwise if the threshold is 
greater than the multi-bit pixel, then the resultant single-bit output pixel remains at "zero". In this manner, thresholding 
extracts those pixels, such as those which form characters, or other desired objects, from the background in a scanned 
gray-scale image, with the pixels that form each character, or object, being one value, typically that for black, and the 
pixels for the background all being another value, typically that for white. For ease of reference, we will hereinafter 
collectively refer to each character or other desired object in the image as simply an "object". 

Ideally the best thresholding process is one which accurately selects all the object pixels, but nothing more, in the 
scanned image and maps those pixels to a common single-bit value, such as, e.g.. "one" for black. In practice, noise, 
background shading, lighting non-uniformities in a scanning process and other such phenomena, preclude the use of a 
single fixed threshold for an entire image. In that regard, if the threshold is too low. the resulting image may contain an 
excessive amount of noise in certain, if not all regions; or. H too high, insufficient image detail, again in certain, if not all, 
regions - thereby complicating the subsequent processing of this image. Given this, the art recognizes that a preferred 
approach would be to select a different threshold value that is appropriate to each and every pixel in the scanned image. 
In doing so the proper threshold value is determined based upon local properties of the image, i.e., certain image 
characteristics that occur in a localized image region for that pixel. Hence, the threshold would vary across the image, 
possibly even adapt to changing localized image conditions. 

In general a common methodology for variable thresholding relies on measuring localized image characteristics, 
such as local intensity contrast (or gradient), local averaged intensity and/or local variance, in a local window centered 
about a pixel of interest and then using these measures to classify image pixels into either object pixels, black, or back- 
ground pixels, white. Here, too. reality diverges from the ideal inasmuch this methodology is complicated, and often 
frustrated by a need to extract various objects in a wide range of documents but with minimal user intervention, such 
as for purposes of initialization and object identification, and while still yielding a clean background in the thresholded 
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ohiects mav include e g., dim. broken objects; and objects that present a relatively low contrast, 
image. In realrty these gray embedd ed in a black background, 

such as white -n m^'ina^ d fixed thresholding to provide adequate performance w«h actual 

Nev tn! ! rX thresho,din 9 approaches thal at,empt ^T? 6 .f^SZ 

,mages. the art has persistec i oy teac g drawbacks that, in practice, tend to l.mrt their ut, My. 

performance. TE£££™** ol drfferent localized image properties, are taught in M. Karnel * 

Venous ^^^^^SS^k^ from Grayscale Document Images". r.VGIP- Graphical Models artf 
■L. "Extraction of B.na y < Charad 9 / H a ical lever ^oach is based on companng a 

^Processing . Vol. 55 ^Na *W™r> raXe (if the image is noisy) with four local averages in neighborhoods 
gray level of « 9'vervp«el ^^X^S^tiS^ A « •« gray level of the given pixel is sufficiently below 
centered about tour p.xels orthogonally su ™"™^ " * a h aDDroa ch so-called "mask-based subtraction", relies 
all tour .oca. averages then the First most of 

on considenng every pixel . a™*ge as a »« a 9 » 1unctioning t0 remove -p artic ,e- noise. The 

the background pixels are detected us W**Q***^ ^ each giyen pixel ^ 5lopes 

tirterisapp.iedtofourp.xe.seq^ 9 ch P racter/graphics pixels which are black and 

of 0. n/4. n/2 and 3n/4. The »«^"9 ZTSLm or "mask" image is modified by detecting additional back- 

background pixels which are ^^^^^J^^S^ character/gUics pixel, the gray .eve. 
ground pixels us.ng a P^**™^ pixels. Lastly, a gray-scale character/graph- 

of its background image .s est mated ^''^^f^^^^g from P the origina | scanned image with resulting 

differences then ibeni ^^^"~ a dHference between this threshold and a varying background value essent,ally 
value .tse« ^JSi^SSS - «rst blush, these two approaches would appear to be *om^ ^rnune 
Iodise* « ract^Ceac^ese approaches is high.y sensitive to noise and often resuHs in a no,sy bac*ground when 

extracting dim and/or broken a Jjj£"^ 4 ^ 670 (issue d to R.R.A. Morton et al on September 

Another approach, asdescnbed .n Unrted ^« J££V -*J^ 7 ° i^ Qround value in an image, with a threshold 
19 . 198 9 and Here, whenever a transition 

value being a sum * 

ties in detecting low contrast objects. t a| Qn ^ ^ 28 n ^ 

A further approach ^ descried . n Un^ ^ a ^ HWW 

Here. adaptive thresholding "J^mrMd £*n£ Thfeoffset potential te used in conjunction 

basis as afunc^ono whrte^ 

with nearest "W^V^^™ "JJ "^^^g the ima ge potential of that pixel with predetermined mrnrmum 
^'^STE ^^ZV^^- Unsafely, this techn^ue also appears to exhibit drfficuK.es ,n 

extracting low contrast ^J^^^ specifically apparatus and an accompanying method, for accu- 
Therefore. a need e «ste >n th » , art Mtor a tecnnqu pe y k threshold that varies based on local 

ra ,ely and rejab.y immunity and reduced bo^ary artifacts, as we,, as 

Ss^rac^ V^Z «ow contrast objects, than do conventional viable thresholds techn^es. 

p,o^, ^c. irf OF THE INVENT ION 

We have substantially and advantageous* overcome the def iciencies in the art through our invents multi-window 

thresholding technique. Wm «8k, ^rori.ont nfi n is determined lor each pixelfi.j) in an incoming 

scanned gray-scale M-mapped imace. i r»s orao,e i delected to «i> N-byN pixel window 

about pixeKIJ). Next. ntinanum tnterawy. ma»» «uM « , 6um „, ^kMrt intensity 

gradients, is ascertained tor a mauiMwmouw; u h ^ backoround oxel i.e.. white, primarily based 

lean M * X'SSrS ?SS*WJ^ll SC^Si ^ese .h-eep.^ ate 
upon its area gradient. GS(i.j). ana > seconaaniy u P ol > » « hpin^centered in various windows, one for determining 

s^^rSiS^^ 
gradient GS(i j). 
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Using these local measures, each image pixel(ij) is first classified as being near an edge or not. In that regard, each 
nixel near an edge is so classified if the local measures therefor, specifically its associated area gradient, exceeds a 
ore-defined threshold value. Otherwise, that pixel is viewed as falling within a locally flat field, i.e., of relatively constant 
orav-scale Specifically, edge pixels are classified by comparing their intensity values against average intensity values 
for local N-by-N pixel windows therefor; in contrast, pixels in flat fields are classified by comparing their intensity against 
a pre-determined intensity value. 

In particular, an image pixet(ij) located in a vicinity of an edge in the image is detected whenever the area gradient, 
GS(i D for that pixel is high, particularly being larger than a predefined gradient threshold, GT. Once such a pixel is 
found each pixel on a darker side of the edge is found. This latter pixel being in the center of a local N-by-N. e.g.. 7-by- 
w 7 pixel window is classified as an object pixel, i.e., black, whenever its intensity. Lc, is smaller than an average of its 
associated L min and values. Alternatively, it the intensity of this latter pixel, Lc, exceeds the average intensity value 
of L and lT!" then this particular pixel is classified as being background, i.e., white. 

Where ttearea gradient, GS(ij). for pixel(ij) is less than the gradient threshold, such as where the N-by-N window 
slides along an image region of relatively constant gray-scale, pixel(ij) is classified by simple thresholding, i.e.. thresh- 
olding its gray-scale value against a constant predefined threshold value, IT. Here, if the gray-scale value of pixel(i.j) is 
less than the threshold value IT. this pixel is classified as an object pixel, i.e., black; otherwise, this pixel is designated 
as part of the background, i.e.. white. 

To enhance image edges and by doing so further increase thresholding accuracy, the intensity values of those pixels 
near an edge are modified. For the preferred embodiment, black pixels are illustratively represented by a value of zero 
90 and white pixels by a value of "1 ". Specifically, for those pixels located on a darker side of an edge and having an intensity 
value that exceeds the threshold value. IT, the intensity of each of these pixels is reduced to a value slightly below 
threshold IT. e g . to a value IT-. In contrast, the intensity values for those pixels, located on a brighter side of an edge 
and having an intensity value less than threshold value IT. are increased to an intensity value slightly greater than thresh- 
old value IT. e.g.. to value IT+. A high quality binary image can then be produced by merely thresholding the resulting 
75 gray-scale irnage against pre-defined threshold value IT. 

Advantageously, through the use of area (summed intensity) gradients, the present invention signrficantly reduces 
both background noise and boundary artifact generation while accurately extracting objects in a thresholded image. The 
modified gray-scale image, when subsequently thresholded using a fixed threshold value IT. greatly enhances detection 
of low contrast objects and thus readily produces a high quality binary image. 

30 RRIPF DESCF iPTiriM OF THE DRAWINGS 

The teachings of the present invention may be readily understood by considering the following detailed description 
in conjunction with the accompanying drawings, in which: 

35 FIG. 1 depicts a high-le/el block diagram of document imaging system 5 that embodies the teachings of our present 
invention; 



to 



FIG. 2 depicts the correct alignment of the drawing sheets for FIGs. 2A and 2B; 

FIGs. 2A and 2B collectively depict a high-level flowchart of our inventive multi-windowing thresholding method; 

FIG. 3A depicts a 3-by-3 matrix of pixel location definitions which we use in determining, through a "Sober gradient 
operator, gradient strength for current pixel(ij); 

FIG. 3B depicts a graphical representation of an N-by-N neighborhood of pixels centered about a current pixel(i j) 
in an image intensity record; 

FIG. 3C depicts a graphical representation of an (N-2)-by-(N-2) neighborhood of pixels centered about pixel position 
so (i.j) in an image intensity gradient record; 

FIG. 4 graphically depicts intensity profile 400 of an illustrative scan line of an original gray-scale image, such as 
that illustratively on document 10 shown in FIG. 1 ; 

js FIG. 5 graphically depicts a modified intensity profile which results after thresholding intensity profile 400 shown in 
FIG! 4 according to the teachings of our present invention; 

FIG. 6 depicts a block diagram of a preferred embodiment of our present invention; 
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FIG. 7 depicts a block diagram of Sum of Gradients Circuit 620 which forms part of circuit 600 shown in FIG. 6; and 

FIG. 8 depicts a block diagram of 7-by-7 Maximum and Minimum Detector 635 which also forms part of circuit 600 
shown in FIG. 6. 

To facilitate understanding, identical reference numerals have been used, where possible, to denote identical ele- 
ments that are common to various figures. 

MQBES OF CAP RVN^ OUT THE INVENTION 

After reading the following description, those skilled in the art will quickly realize that our invention is applicable tor 
- » 2 n^riv an? diument imaging system tor accurately thresholding scanned documentary .mages containing line 
ad '^nlSSSSSS that contains and the media upon which the image originally appears. For = .e 

ne we wnTdiscuss our invention in the context of use in ****** scanned character data, such 

Z : i«tuai letters and numbers, which we will collectively refer to hereinafter as "objects . 

as textual letters ana nu . w black and white grav . sca , e pixels as having e.ght-brt gray- 

For P^poses of * Fof consis1ency , we also de »ine output binary pixels with black being "0" and 

££22 that Ampere pixel intensities, as would be readily apparent to those skilled ,n the art. reversed from those 

^G^DictTattah-level block diagram of document imaging system 5 that embodies the teachings of our 
inJS, A?sSwn system 5 is formed of gray-scale scanner 20. digital image processor 30 and Simple" thresho.d.ng 
,nve ^'°"- ^^ton document 10 containing various objects of interest is scanned by scanner 20 to produce muft- 
Z^^ST^^S Q^cale image date. L This data is routed, via leads 25. to a data input of digite. 
±™ S£es2l which in turn and to the extent relevant here, thresholds the scanned image date and generates both 

on leads 37 and an output binary image on leads 33. The modified fl ray-sca»e .rnaga as 
I SfniaS detail below has its edges enhanced by processor 30 to subsequently increase threshold.ng accu- 

£f ™ leads 37 * rOUted t0 *" inpUt 01 threSholdin S drCUit 4 i Wh ^ 
racy. The modified gray scaie image «* w a Q „ air «, a nrc-defined fixed threshold value thereby 
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Pix^. inThe^S^e date against a preyed « va*e mere* 

^SS^ ^Sled-simple" thresholding. The resulting thresholded. i.e.. binary, image produced by circuit 40 « applied, 
proyidingso-called simple i processor 30 performs thresholding in accordance with our 

V ' a ^rienTo T£ ^ZS^^S^ requLtor processing bit-mapped images, processor 30 is prefer- 
S ai <E5£ hardware circuSy inducing our inventive circuit 600 shown in FIG. 6 and discussed ,n 
S£i^S^te^thSy this processor can be implemented through one or more general purpose miaoprocessors. 

IssSted men^rVand supporting circuitry and suitably programmed to implement our present .nvenfon m soft- 
ware thatTe" micropr^ess^s) could execute instructions sufficiently fast to produce the requisite date 

,hr ° A Sieve, flowchart of our inventive multi-widowing thresholding method 200 is college* depicted I in FIGs. 2A 
and 2B Tr wN* the correct alignment of the drawing sheets tor these f igures is shown .n FIG. 2. During the course of 
ana reference will also be made to FIGs. 3A-3C where appropriate. 

^SSiSfS Peno meS I to J5. £ every grayscale pixel in an incoming image, thereby iterating through the 
the n^nne through which each grayscale image pixel is processed is identical across all such pixels, we 
TSrt v dfecussThfe routine in the context of processing generalized pixel(i.j). where i and j are potftve integers and 
tiSSX^S^ indices within the incoming grayscale image. In addrtion to grayscale image date. 
t us7r ZrtZ values of two input parameters, specifically fixed thresholds IT and GT. These thresholds are adjusted, 
e XcMbS be^ow 7rther to provide satisfactory thresholding across a multitude of diHering, mages with varying char- 
^SSIuch 7s contrast and illumination, or can be adjusted to provide optimum thresholding for any one type of 
fmlofaS/o^ ^SLnning^ondition. In any event as a result of performing method 200 on all such grayscale image pixe£ 
LTesh^ed SnaTy o' modified grayscale image will be generated, with each pixel in the resultmg image 
I Z £2 in me image. Atthough not shown in FIGs. 2A and 2B. H a modified gray-scale image is o be Produced, 
n^mao tSSSSSrtli in a "sirnple" fashion using a fixed, pre-defined threshold, having a value IT, as that value 
!! ^dS belcl ^The modified grayscale image can then be further processed, e.g.. through scaling or filtenng. as 

^foarticular upon starting method 200. step 203 is firsl performed. This step determines a S^adiem strengthfor 
ni«^rThte fe accomplished by use of the so-called "Sober gradient operator on a window of prxete centereo^bout 

SetediniuS 



5 



EP 0 712 094 A2 

GX(ij) and GY(i,j), respectively, and for each pixel position (ij) forming the gradient strength, G(i j) as an absolute sum 
of GX(i.j)andGY(ij): 

GX(i.j) = L(i+1 j-1) + 2L(i+1 j) + L(i+1 j+1) - L(i-1 j-1) - 2L(»-1.j) - L(i-1 j+1) (i) 
GY(ij) = L(i-1 j+1) + 2L(i.j+1) + L(i+1,j+1) - L(i-1 j-1) - 2L(i,j-1) - L(i+1 (2) 
G(ij) = |GX(i f D|+|GY(ij)| (3) 

where: G(i j) is the gradient strength at pixel position 
(M):and 

L(i.j) is the image intensity, luminance, at pixel 

The res^g^radient strength value for every image pixel collectively forms a gradient strength record for the entire 

scanned image. , ' M „ ■ ; , 

Once the gradient strength, G(i,j), is determined for pixel(i,j*). method 200 advances to step 206. This step, when 
performed determines the area gradient for pixel position (i.j), i.e. GS(ij). as the sum of the gradient strengths for each 
and every pixel position within an (N-2)-by-(N-2) window centered about pixel position (i.j). This window is illustratively 
5-by-5 pixels in size and is shown, as window 370, in FIG. 3C for an N-by-N, illustratively 7-by-7. window shown in FIG. 
3B the latter being window 330, centered about pixel(ij). For purposes of comparison, the periphery of an N-by-N 
window is shown as window 350 in FIG. 3C. Though shown sequentially to simplify illustration, step 245 is generally 
performed at essentially the same time, as symbolized by dashed line 244, as is steps 203 and 206. Step 244 determines 
both minimum and maximum pixel intensity values, L™ and L^. respectively, in an N-by-N window centered about 

PIXel Once the area gradient is determined for pixel position (i j). then, through the remainder of method 200, the three 
image measures. GS(i.j) and and L^. associated with this particular pixel position are used to classify pixelfij) 
as an object pixel, i.e.. black, or a background pixel, i.e.. white. 

In particular, decision step 208 is performed to assess whether the value of the area gradient for pixel(ij). i.e.. GS(i.j). 
exceeds a pre-defined threshold value, GT, or not. This test determines whether pixel(ij) lies in a vicinity of an edge in 
the scanned image, or not. If the area gradient is less than the threshold value, GT. then pixel(i.j) does not lie near an 
edoe Hence pixel(i j) lies within a localized relatively constant tone. i.e.. "flat" field, region of the scanned 9 r*y-*wile 
image In this case, method 200 advances along NOpath211 emanating from decision block 208 to decision block 21 5. 
This latter decision block determines whether the gray-scale intensity of pixel(ij). i.e.. L^. exceeds another pre-def ined 
threshold value. IT. If this gray-scale value is less than the threshold, the pixel is classified as an object pixel. i.e., here 
black' otherwise the pixel is classified as a background pixel, i.e., here white. Specifically, on the one hand, if pixel 
intensity L- is less than or equal to threshold IT, decision block 215 advances method 200, along NO path 217, to 
decisionblock 222. This latter decision block determines whether a user has previously instructed the method to produce 
a binary or gray-scale output pixel. 11 a binary pixel is desired, then decision block 222 advances the method, via YES 
path 225 to block 228 which, in turn, sets an intensity of output pixel(i.j) to black, i.e., zero. Alternatively. « a gray-scale 
output is desired, then decision block 222 advances method 200, via NO path 224, to block 230. This latter block, when 
performed sets the intensity of output pixel(i.j) to the intensity of Uc, without any modifications made thereto. If, on the 
other hand, pixel intensity. lc. exceeds threshold IT, decision block 215 advances method 200. along YES path 218. to 
decision block 232. This latter decision block determines whether a user has previously instructed the method to produce 
a binary or gray-scale output pixel. If a binary pixel is desired, then decision block 232 advances the method, via YES 
path 235 to block 237 which, in turn, sets an intensity of output pixel(i.j) to white, i.e.. one. Alternatively, if a gray-scale 
output is desired, then decision block 232 advances method 200. via NO path 234, to block 241 . This latter block, when 
performed, sets the intensity of output pixel(i j) to the intensity of L^, without any modif ications made thereto. Once step 
228. 237 or 241 is performed, method 200 is completed tor pixel(ij); the method is then repeated for the next image 
pixel in succession, and so forth. . . . ^ 

Alternatively, if area gradient. GS(i.j) exceeds the threshold value. GT, then pixelrj j) lies in the vicinity of an edge. 
In this case, step 245 is now performed to ascertain the values of and L^. if these values have not already been 
determined as set forth above. Thereafter, method 200 advances to block 248 which calculates an average pixel intensity 
value, ^ . occurring within the N-by-N pixel window centered about pixel position (i.j). This average intensity value is 
simply determined by averaging the values L^* and 1^^. 

Once this average value has been ascertained, method 200 advances to decision block 251 which compares the 
intensity of pixel(i.j), i.e., against "its associated average intensity value, L^g. When this pixel is situated on a darker 
side of an edge and hence an object pixel, then its associated average intensity value will be greater than or equal to 
its intensity value. Lc. In this case, the output binary pixel for pixel position (i j) will be set to black. Otherwise, if pixel(ij) 
is situated near an edge but its average intensity value. L avg , would be less than its intensity value, Lc, then this pixel is 
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situatedonthe.ighter^^^ 

L avg .thendecisionbtock251 ^^^SS method to p Soce a binary or gray-sca. e output pixel. H a 
determines whether a user has Piously ^ructed I the P ^ 265 to block 268 ^j*. in turn, 

binary pixel is desired, then <^ »"* *° ^JSS2 » a gS-sca.e ou^ut is desired, then decision block 260 
sets an intensity ot output p.xel(..j) to black, te.. zero. However n a g y pert ormed. determines whether 

advances method 200. via NO path Wtod»»bM ^70. msla^r £ ^ ^ ^ ^ 
pixel intensity. U. exceeds the P r ^ d f sets the gray-sca.e outputtor 

,X then method 200 advances along NC the va.ue of threshold IT, then method 

£o1dL^ ^ sets the 9ray - sca,e output tor pae,(i,ft to 8 

value that is slightly less. I.e.. IT-, than threshold I IT. associated average intensity value. L^, then decision block 

Alternatively, it gray-scale p.xel.ntens.tyk exceeds , atter d * ision bl0 ck determines whether 
251 advances method 200. along YES path 2 53. to decwon js desired> 

a user has previously instructed I b £ S JS,. ^turn. sets an intensity of output 

then decision block 280 advances the method v« YES path 282 ^ methcxl 200. 

pixe.(i.j) to white, i.e.. one. However rf y^^^^SJiXl"^ ^her pixe. intensity. ^ is less 
via NO path 284. to decis.on block 290. This latter " P !™ eed8 fe ^ to threshold IT. then method 200 
than pre-defined threshold value IT IMMJ l^Jxe^^The grayscale output for P ixe.(ij) to the pixel 
advances, along NO path 292. to ^J^^^^SS^^^ 200advances. along YES path 
intensity value U- Now. H the ,ntens,tyof P^'^^JSS pi el(iJ) to a va.uethat is slightly larger, i.e.. IT*. 
294. to block 297. which, when execute* sets ta m «h ou*tf or P threshold IT is not critical 

r rS^^^c^eP^ KK * - -n performed for pixelflj); method 200 . 

then repealed for the next i^e pixel i ^"^^".SEaliy depicts intensity prof ile 400 of an illustrative scan 
With this understand.^ m m,nd. consider F G • 4 ^^^^ 10 shown in FIG. 1 . As a result of processing 
Une of an original grayscale ^J^^SS^S^m in FIG. 5. results. As is readily apparent 

profile 400 through our .nventive method mod** 'WP^ '» , ocated near an ^ w mo dHied. 

from comparing profiles 400 ^.^^S^-ST* edge and while its intensity is larger than threshold 
in this regard, when such a p,xel I . £«todm a ^ e ^ e s ^, y ,ess than threshold IT. Alternatively, when an image 
value IT. the intensity of the pixel .s decreased to a va uej I T . sngnn y tnresnold , T . the intensity of this pixel is 

As shown, circuit 600 is formed of l.ne 650 and 655; 7-by-7 Max and Min detector 635; 

gradient detector 615. sum of gradients -^^^SS^S^^ '^rter 664. and gates 662. 670 and 
averaging circuit 640; muttp exors ^^f^^^Z in FIG.. 2A and 2B and discussed in detail above. 
675. Circuit 600 collectively ^•^^^J*^"^ in parallel, via leads 605 and 608. to an input of 

,n operation, eight-brt input V*^™**™*™^^. Decays 610. specially series-connected single 
, ine delays 610 and to an ,nput of 7dy-7 Max and J^JJJJ^ imag e pixel data centered around image P ixel(ij). 
line delay elements 610,. 610 2 and 610 3 . prov.de a 3-by « 9 H g ^ a)| tnree 

Eachdefay element is tapped at three ^^J?f^S!£^^^ ca,CU,ati ° n * 8 * - 

delay elements are routed to respective .npute of gradient detert-on «™ (j j} on ou ^ ut leads 617 . This 

operator, as described above, provdes an 'ntensrty grad ent ^^^^^^^^M^yAm 
gradient value is applied to an input to sum of, grad ^^^^SvJIiirt sum. as an area gradient value, i.e.. 
Si a S-by-5 window centered about p.xe. porton M ^ PJ^*J^wS detail below. This area gradient value 
GS(ij). on 12-bH output leads 622. Crcu.t e»i « shown .nFIGJ and W» predefined gradient threshold. GT: 
is allied to an input ("A") of "mP"^ SJSSS^^i comparison output, appearing on 
the latter being applied to another I^^^^SSSSm^ or not. The comparison output is applied as an 

690 - as discussed in detaM be,ow - the appropriate 

SJSSSSSfi- one of the predefined values IT-or.T + . 
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Max and Min detector 635, depicted in FIG. 8 and discussed in detail below, ascertains the maximum and minimum 
pixel intensity values, i.e.. L max and L^, of those pixels contained within a 7-by-7 window centered about pixel(i,j). 
Resulting L and L min values, appearing on leads 637 and 639, are then averaged by averaging circuit 640. This circuit 
contains summer 642. which merely adds these two values together into a resultant sum. and 12 and delay circuit 646 
which irrplements a divide by two operation by shifting the sum one-bit to the right; hence dropping the least significant 
bit The resultant averaged intensity value is then appropriately delayed within circuit 646 for purposes of achieving 
proper synchronization within circuit 600. 

If the area gradient value. GS(ij), exceeds the pre-defined gradient threshold, GT, then the averaged intensity value. 
L used as a threshold level in determining the value of output center pixel(ij); else, if the area gradient value is less 
than or equal to the gradient threshold, then value IT is used as the threshold level in determining the value of output 
center pixelfi j). In this regard, the averaged intensity value, L avg , is applied to one input of comparator 650. The current 
center pixel value, Uc, is applied through delay element 630 to another input of comparator 650 as well as to one input 
of comparator 655. The delay of element 630 is set to assure that the proper corresponding values of and L^g are 
synchronously applied to collectively comparators 650 and 655, multiplexors 680 and 690. and gating circuit 660. The 
fixed threshold value, IT, is applied to another input of comparator 655. Comparator 650 produces a low or high or low 
level output on lead 653 if pixel intensity value Uc is less than, or equal to or greater than, respectively, its associated 
averaged intensity value, Le vg . This output level is applied to both one input ("1") of multiplexor 680 and one input of 
AND gate 662. Multiplexor 680 selects the output binary pixel as between two bits generated from using two different 
thresholds i.e., value IT or L avg . In that regard, comparator 655 produces a high output level at its A>B output whenever 
the predefined threshold value IT is greater than pixel intensity value 1^. The level at output A>B is routed, via leads 
657 to another input TO* 1 ) of multiplexor 680 and to one input of AND gate 670. Multiplexor 680 produces the output 
binary value for pixel(i.j). To do so, the comparison output level produced by comparator 625, indicative of whether the 
area gradient. GS(i.j). exceeds the gradient threshold, GT, is applied as a select signal, via lead 628. to a select fS") 
input of multiplexor 680. If this select signal is high, thereby indicating that the area gradient exceeds the gradient thresh- 
old, then multiplexor 680 routes the level applied to its "1 " input, i.e., that generated by comparator 650. to binary output 
lead 33- else, the multiplexor routes the level then appearing at its "0" input to lead 33. 

Multiplexor 690 routes the center pixel value. L^. or a pre-defined value IT+ or IT- as the output gray-scale value to 
eight-bit output leads 37. Pre-defined values IT+ and IT- along with center pixel value U are applied to different corre- 
sponding eight-bit inputs to multiplexor 690. The output of this multiplexor is determined by the state of two select signals, 
SO and SV H these signals assume the binary values zero, "1" or "S". then the multiplexor will route center pixel value 
i va(ue |j + or value IT-, respectively, to output leads 37. Gating circuit 660. in response to the comparison output 
sfanals produced by comparators 625, 650 and 655 generates, by simple combinatorial logic, the two select signals, SO 
and SI applied to multiplexor 690. Specifically, if the area gradient is not larger than the gradient threshold, then, as 
discussed above gray-scale pxel value Uc. without any modification, will be applied through multiplexor 690. to output 
leads 37 Alternatively, if the area gradient exceeds the gradient threshold, then multiplexor 690 will apply either value 
IT+ or IT- as a modified gray-scale output value to lead 37 based upon whether center pixel intensity value 1^ is less 
than its corresponding average pixel intensity value. L avg . or not. Since the operation of gating circuit 660 is serf-evident 
from the above description to anyone skilled in the art, it will not be discussed in any further detail. Values IT- and IT+ 
are merely held in appropriate registers (not shown) and from there applied to corresponding inputs of multiplexor 690. 

FIG 7 depicts a block diagram of Sum of Gradients Circuit 620 which forms part of circuit 600 shown in FIG. 6. 
Circuit 620 as shown in FIG. 7, is formed of adder 710 and line delays 720, the latter containing four series-connected 
single-line delay elements 720 0 . 720 1( 720 2 and 720 3 . Incoming gradient intensity values are applied, over lead 617 
from gradient detection circuit 615 (shown in FIG. 6), to one input of adder 710. shown in FIG. 7. In addition, the output 
of each of the four line delays 720 is routed over a different one of four feedback leads 715 to a different corresponding 
input of this adder. As a result, adder 710 forms partial sums of 1-by-5 gradient values which are applied, as input, to 
delay element 720 0 . To generate a 5-by-5 sum of the intensity gradients, hence producing an area gradient measure, 
four preceding corresponding 1 -by-5 sums are added together within adder 71 0 to a present 1 -by-5 sum. with the result- 
ant overall 5-by-5 sum being applied to output lead 622. Use of these delay elements connected in this fashion substan- 
tially reduces the amount of logic that would otherwise be needed to generate twenty-five intermediate sums. 

FIG. 8 depicts a block diagram of 7-by-7 Maximum and Minimum Detector 635 which also forms part of circuit 600 
shown in FIG. 6. As noted above, detector 635 determines the maximum and minimum pixel intensity values. and 
U, respectively, within a 7-by-7 window centered about pixel position (i.j). This detector is formed of two separate 
circuits- maximum detecting circuit 805 and minimum detecting circuit 840 which extract a pixel value having a maximum 
and minimum intensity value, respectively, of all the pixels contained within the 7-by-7 window centered around pixelfi j) 
and respectively apply those two values to leads 637 and 639. Circuit 805 contains maximum detector 810 and line 
delays 820, the latter being formed of six series-connected single-line delay elements 820 0 . 820i, 820 2 . 820 3 , 820 4 and 
820 5 Similarly, circuit 840 contains minimum detector 850 and line delays 860, the latter being formed of six series-, 
connected single-line delay elements 860 0 , 860^ 860 2 , 860 3 . 860 4 and 860 5 . Inasmuch as circuits 805 and 840 function 
is an identical manner with the exception that detector 81 0 locates maximum values while detector 850 locates a minimum 



8 



EPO 712 094 A2 



w 



is 



so 



. - hoth from the same series of pixel intensity values applied over lead 608, we will only discuss circuit 805 in detail; 

moTesk^^ Mnl hhh- tH 

those sk,nea in ine input ol maximum detector 810. In addition, the 

Incornmg pixel "JjJjJ" S^ouTed^ver a different one ofsix feedback leads to a respective input of this 
Tit rSTsSS "JS^cSSSfo extracts the maximum pixel intensrty value from a 1*y-7 series of pixe. 
detector. Consequent^ ™* d , , , ^ , n conjunction with six preced.ng corre- 

ValUe Jn a a mLTmum 820. maximum detector 810 locates the maximum pixel intensity 
sponding maximum pixene ues x vaiu ^ fc leads ^ Here ^ use ^ 

^«r^ 0 ini^rmf»diate maximum and minimum pixel intensity values. 
l ° 9 Ts one Sn no^ aviate, the quaHty of the resulting thresholded image produced by our . ration ,s controHed 
As one can now apv Threshold GT is set to properly classify image pixels located in the vicinity 

° 22 oixels Si image regions of uniform tone. i.e.. Hat" fiekte. .n that regard, a low IT value tends to classrty 
classify pixels locatea in ay a rP laiivelv hiah GT value is used, then our invention for all practical 

purposes P* rt ° rms J' 1 ™* are a Gradient thereby causing our invention to classify pixels by essentially solely com- 
^VT£'xe £ aTatnst fix^ t e hre y shold ,t 9 Our invention can also produce an outHne image :*rough 
SSe vJue of mrihoW IT to Jo. in which case all the image pixels wi.. be dassHied as white except for the edge 

UhlrJ ^^hei; associated area gradients are larger than the gradient threshed . 
P ' AHhouoh one Ser Id embodLnt of the present invention has been shown and described ,n detail herem many 
othefS erTodlments that incorporate the teachings of the invention may be easHy constructed by those ski..ed ,n 
the art 

. |twr -i.i app| .r.ARILITY AND ADVANTAGES 

^ .mention is useful in image scanning systems and particularly for thresholding scanned gray-scale 

6 P T nlLTne ^ P rovides more aCCura1e thresho,din S that prey "° US y 

Snltxet The^X en^ immunity ^background noise in the scanned image and substantially 

eliminating boundary artifacts from a thresholded image. 
Claims 

« a me ,h«i for thresholding an input gray-scale image into a first output image, wherein said input image is formed 

fe EtSi p^Ss each Ling a corresponding output pixe. value associated therewith, characterized .n that the 

^^TaS in SsrSnte to a plurality of pixel values in the input image, an area gradient value for a first 
wirKlow oi^efinS size, of intensJy gradient values associated with ones of the pixel values centered 
Tk™ rt a current Pixel position. (i.j) (where i and j are pixel indices), in the input image; 
*"* det mWng i ^response to a second window (330). of pre-defined size, of input pixels 
currem pte^sitfon <ij) in the input image, a maximum pixe. value and a minimum P .xel value of the pixel values 

contained SL va.ue and the maximum and minimum pixe. values, a current one of 

4S ES^'ST^SSSSm in the input image into an output pixe. va.ue at a corresponding 
position in the first output image. 
« Th^mMhoriof claim 1 wherein the area gradient ascertaining step comprises the steps of: 

2 ' meting in response to a third window (370). of the pixel values, of a predefined size and centered I abort 

, he cur^nt opposition (i.j) in the input image, an intensity gradient. Gf.J). value for the current pixel posrton such 
60 Jr^t to a suSesSn of input pixels in the input image, a plurality of corresponding intensity gradient values * 

^'"^rnfng the first window of intensity gradient values from the plurality of intensity gradient values and. in 
response to said first window, determining said area gradient value. 
55 , Th* method in claim 2 wherein said first window forming and area gradient determining step comprises the step of 
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The method in claim 3 wherein said thresholding step comprises the steps of: 

comparing the area gradient value. GS(i J), to a pre-defined gradient threshold value. GT. so as to produce 
a comparison signal indicative of whether a current one input pixel located at the current pixel position (ij) in the 
innut imaae lies in a vicinity of an edge in said input image: 

if the comparison signal indicates that said current one input pixel does not lie in the vicinity of the edge, 
thrpsholdina in response to said comparison signal, said input pixel value at position (i j) in the input image against 
a pre-defined fixed gray-scale threshold value. IT. to yield a corresponding binary output value at pixel position (i j) 

^he^mpa^son signal indicates that said one current input pixel does lie within the vicinity of an edge, 
settina in response to said comparison signal, said corresponding binary output value to a first or second state if 
the current onepixel value either exceeds or is less than an average of the maximum and minimum pixel values, 
respectively. 

The method in claims 3 or 4 wherein said intensity gradient measuring step comprises the step of producing the 
intensity gradient value for the current pixel position by processing said third window of input pixel values through 
a Sobel operator. 

The method in claim 5 wherein the third window is a 3-by-3 window of input pixel values and the Sobel operator 
implements the following equations: 

GX(ij) = L(i+1.j-1) + 2L(i+1 j) + L(i+1 j+1) - L(M j-1) - 2L(i-1.j) - L(i-1 j+1): 

GY(ij) = MM j+1) + 2L(ij+1) + L(i+1 j+1) - L(i-1 j-1) - 2L(ij-1) - L<i+1.j-1); 

and 

G(ij) = |GX(ij)| + |GY(ij)| 

where: L(i j) is an input pixel value, in luminance, for 
the pixel position (ij) in the input image. 

The method in claim 4 further comprising the steps of: 

producing in response to said comparison signal and the pixel value (Lc) at the current pixel position (i j). a 
modified arav-scale output pixel value at position (ij) in a second output image, wherein. H the comparison signal 
indicates that the one current input pixel does not lie in the vicinity of the edge in said input image, the modified 
nrav-scale output pixel value is set equal to the pixel value Lc. or if the comparison signal indicates that the current 
^ne incut pixel does lie in the vicinity of the edge, the modified gray-scale output pixel value is set equal to a pre- 
defined value greater than or less than the threshold value IT when the pixel value Lc is greater or less than, respec- 
tively the threshold value IT; and , 

thresholding the modified gray-scale output pixel value, against the threshold value IT. to yield a conesponding 
second binary output pixel value at a pixel position (i.j) within a second output image. 

ADDaratus for thresholding an input gray-scale image into a first output image, wherein said input image is formed 
ofinout pixels each having a multi-bit gray-scale pixel value associated therewith, and wherein said first output image 
is formed of pixels each having a corresponding output pixel value associated therewith, characterized in that the 
apparatus utilizes the steps recited in any of claims 1-7. 
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A multi-windowing technique lor threshholding an image using local image properties 



(54) 

(57) A technique, specifically apparatus and an 
accompanying method, for accurately thresholding an 
image based on local image properties, specifically lumi- 
nance variations, and particularly such a technique that 
uses multi-windowing for providing enhanced immunity 
to image noise and lessened boundary artifacts. Specif- 
ically a localized intensity gradient. G(i j). is determined 
for a' pre-defined window (300) centered about each 
image pixel(ij). Localized minimum and maximum pixel 
intensity measures. L™ and L^. respectively are also 
determined for another, though larger, window (330) cen- 
tered about pixel(ij). Also, a localized area gradient 
measure. GS(ij). is determined as a sum of individual 
intensity gradients tor a matrix of pixel positions (370) 
centered about pixel position (i j). Each image pixel(i.D is 
then classified as being an object pixel, i.e.. black, or a 
background pixel, i.e. white, based upon its area gradi- 
ent. GS(ij). and associated l™, and L^x measures. 
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